A Study of Quality and Accuracy Trade-offs in Process Mining

نویسندگان

  • Zan Huang
  • Akhil Kumar
چکیده

The goal of process mining is to extract semantic knowledge from a log consisting of process execution traces for the purposes of process understanding, innovation and improvement. In recent years many algorithms have been proposed to extract process models from logs. The process models describe the ordering relationships between tasks in a process in terms of standard constructs like sequence, parallel, choice and loop. Most algorithms assume that each trace in a log represents a correct execution sequence based on a model. In practice, logs are noisy and algorithms designed for correct logs are not able to handle noisy logs. In this paper we share our key insights from a study of noise in process logs both real and synthetic. Our first finding is that all process logs can be explained by using self-loop and optional structures. Therefore, it is not difficult to build a fully accurate process model for any given log, even logs that contain inaccurate data or noise. Secondly, there is usually not one single, unique process model that can explain a log, i.e. the same log can be explained by a large number of different models. Thirdly, some models are of higher "quality" than others, and given that so many models can explain the same log, it is important to have a metric of quality for a model. Fourth, if a log contains noisy execution traces, a fully accurate process model that explains every trace in the log is not very meaningful because its quality is low. By controlling the use of self-loop and optional structures around tasks and blocks of tasks we can balance the quality and accuracy tradeoff to derive high-quality process models that explains a given percentage of traces in the log. Finally, we describe a novel quality-based algorithm for model extraction in the context of our noisy logs. The results of the experiments with the algorithm on real and synthetic data are reported and analyzed at length.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Defining Pathways and Trade-offs Toward Universal Health Coverage; Comment on “Ethical Perspective: Five Unacceptable Trade-offs on the Path to Universal Health Coverage”

The World Health Organization’s (WHO’s) World Health Report 2010, “Health systems financing, the path to universal coverage,” promoted universal health coverage (UHC) as an aspirational objective for country health systems. Yet, in addition to the dimensions of services and coverage, distribution of coverage in the population, and financial risk protection highlighted by the report, the conside...

متن کامل

Universal Health Coverage – The Critical Importance of Global Solidarity and Good Governance; Comment on “Ethical Perspective: Five Unacceptable Trade-offs on the Path to Universal Health Coverage”

This article provides a commentary to Ole Norheim’ s editorial entitled “Ethical perspective: Five unacceptable trade-offs on the path to universal health coverage.” It reinforces its message that an inclusive, participatory process is essential for ethical decision-making and underlines the crucial importance of good governance in setting fair priorities in healthcare. Solidarity on both natio...

متن کامل

Ethical Perspective: Five Unacceptable Trade-offs on the Path to Universal Health Coverage

This article discusses what ethicists have called “unacceptable trade-offs” in health policy choices related to universal health coverage (UHC). Since the fiscal space is constrained, trade-offs need to be made. But some trade-offs are unacceptable on the path to universal coverage. Unacceptable choices include, among other examples from low-income countries, to expand coverage for services wit...

متن کامل

Coronavirus: Where Has All the Health Economics Gone?

As the coronavirus disease 2019 (COVID-19) pandemic continues to unfold there is an untold number of trade-offs being made in every country around the globe. The experience in the United Kingdom and Canada to date has not seen much uptake of health economics methods. We provide some thoughts on how this could take place, specifically in three areas. Firstly, this can involve understanding the i...

متن کامل

Application of Hazard Based Model for Housing Location Based on Travel Distance to Work

Residential location choice modeling is one of the areas in transportation planning that attempts to examine households location search behavior incorporating their trade-offs between housing quality, prices or rents, distance to work and other key factors. This brings up the need to come up with methods to logically allocate credible choice alternatives for individuals.This article attempts to...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • INFORMS Journal on Computing

دوره 24  شماره 

صفحات  -

تاریخ انتشار 2012